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ABSTRACT 

We have tested Very Small Array (VSA) observations of three regions of sky for the 
presence of non-Gaussianity, using high-order cumulants, Minkowski functionals, a 
, wavelet-based test and a Bayesian joint power spectrum/non-Gaussianity analysis. We 

find the data from two regions to be consistent with Gaussianity. In the third region, 
we obtain a 96.7% detection of non-Gaussianity using the wavelet test. We perform 
K> , simulations to characterise the tests, and conclude that this is consistent with expected 

; i ■ residual point source contamination. There is therefore no evidence that this detection 

d ' is of cosmological origin. Our simulations show that the tests would be sensitive to 

any residual point sources above the data's source subtraction level of 20 mJy The 
tests are also sensitive to cosmic string networks at an rms fluctuation level of 105 fiK 
(i.e. equivalent to the best-fit observed value). They are not sensitive to string- induced 
fluctuations if an equal rms of Gaussian CDM fluctuations is added, thereby reducing 
the fluctuations due to the strings network to 74 [iK rms . We especially highlight the 
usefulness of non-Gaussianity testing in eliminating systematic effects from our data. 

Key words: cosmology: observations - methods: data analysis - cosmic microwave 
background 



1 INTRODUCTION 

The search for non-Gaussianity in the anisotropies of the 
cosmic microwave background (CMB) radiation is of ma- 
jor importance to modern cosmology. Different fundamen- 
tal theories of structure formation predict markedly differ- 
ent non-Gaussian CMB signatures, making the quantifica- 
tion of any such signatures a powerful discriminator be- 
tween these possibilities. Measurements of the CMB angular 
power spectrum have now ruled out the classic non-Gaussian 
example of topological defects as the primary method of 
generating CMB anisotropy, although the possibility of a 



sub-dominant contribution remains. Recent work on dif- 
ferent varieties of the inflationary paradigm (e.g. [ Peeblesl 
ll999UWang fc Kamionkowskill200(J: iBartolo fc Liddldl2002ft 
has shown that different forms of inflation can also be dis- 
tinguished by the characterisation of any non-Gaussianity 
present in the CMB. 

There are other very good reasons to search for non- 
Gaussianity in the CMB. Much of CMB cosmology centres 
on the accurate determination of the angular power spec- 
trum. If the CMB fluctuations form a Gaussian random 
field, then the power spectrum completely characterises its 
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statistical properties. Therefore, the detection of any non- 
Gaussianity would indicate that there is further information 
to be gathered. It is also the case that, with one exception 
jRocha et alJl200lT) . all power spectrum analyses of CMB 
data assume the data to be Gaussian. As with any assump- 
tion, this must be tested if we are to have confidence in the 
results. 

There is also the consideration of contamination of 
CMB data sets. Extragalactic sources, Sunyaev-Z'eldovich 
decrements and the various Galactic foregrounds all play 
their part in contaminating the CMB. Moreover, they 
all introduce non-Gaussianity into the data, making non- 
Gaussianity testing an excellent way to identify their pres- 
ence. Furthermore, data contamination can occur other than 
by the telescope seeing something unwanted in the sky. Sys- 
tematic effects are invariably an is sue. The detection of non- 
Gaussianity in the COB E data bytF crrcira ct al. ( 199q) was 
traced to such an effect feandav et all2000l) and, in general, 
non-Gaussianity testing can be extremely useful in identify- 
ing and removing systematics. 

As a consequence of the many-fold importance of non- 
Gaussianity testing to CMB cosmology, many different 
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These analyses have shown the 
practical application of non-Gaussianity testing, particularly 
the importance of robust Monte Carlo simulation for deter- 
mining the significance level of any result. 

In this paper we present the results of applying a se- 
lection of non-Gaussianity tests to the VSA da ta set first 
presented in i ts entirety in iGrainee et ail (120031) (Paper V 
hereafter) and lSlosar et alJ 200^(Raper VI hereafter). Re- 
sults from the VSA in i ts compact configur a tion have al- 
ready been present ed in Watson et alj <|2003|). |Taylp r et al] 
]2003ft . lScott et al.l J2003I) . and lRubino-Martin et al.M20o'3Tr 
(hereafter Papers I - IV respectively). 



2 INTERFEROMETRIC OBSERVATIONS OF 
THE CMB 

2.1 The VSA observations 

The VSA is a 14-element interferometric telescope operating 
with a 1.5 GHz observing bandwidth, centred at 34 GHz for 
the data analysed in this paper. It has been used in two dis- 
tinct configurations, the compact array (covering the mul- 
tipole range I ~ 150 - 900 with a 34 GHz primary beam 
FWHM of 4.6°) and the extended array (extending to mul- 
tipoles as high as I ~ 1400 with a 34 GHz primary beam 
FWHM of 2.0°). 

The VSA observing strategy has focussed so far on mak- 
ing deep observations on 3 relatively small patches of sky 
(named the VSA1, VSA2 and VSA3 mosaics). These cover 
a total of 101 square degrees of sky, with each region com- 
prising the data from either five or six separate pointing 
centres (three extended array fields and either two or three 



compact array fields). These individual fields are coherently 
mosaiced to produce maps and power spectra. 

Using a combin ation of a 15 GHz blind survey 
JWaldram et al.ll2003l) and regular monitoring observations 
made at 34 GHz by the VSA's dedicated source subtrac- 
tion interferometer, we have subtracted all sources with flux 
densities 5*34 ghz > 20 mjy (or, in the case of the compact 
array fields, 80 mjy) from our data. 

The data set analysed in this paper is described more 
fully in Paper V. 

2.2 The VSA maps 

Some of our chosen tests operate on CMB maps produced 
from our data. We use a maximum-entrop y method (MEM) 
based on that of iMaisinger et al] <ll997t) to produce mo- 
saiced maps of the VSA data. Maisinger et al. show that 
this method produces more accurate reconstructions than 
the sim pler Wiener filter appr oach. Furthermore, it has been 
shown iMaisinger et al.l ll998'l that this method accurately 
reconstructs non-Gaussian features in CMB maps, such as 
the prominent hot spots produced by cosmic strings. This is 
not true of Wiener filtering. 

The MEM maps of the three VSA mosaics are shown in 
Fig. Well outside the FWHM of the primary beams the 
data contain negligible useful information about the CMB 
structure. Therefore, the MEM maps show no features to- 
wards their edges. As these featureless regions contain no 
useful CMB information, we select only the central region of 
our maps for analysis. In all cases these regions are 128 x 128 
pixels, corresponding to 7.36° x 7.36°. An example of this 
is shown in Fig. (bottom right) . It can be seen that the 
region size corresponds approximately to the combined area 
within the FWHM of the primary beams. 

It is noted that this choice of pixel size significantly 
oversamples the VSA data set considered here. For exam- 
ple, the compact array synthesised beam has a FWHM of 
approximately 30'. These maps therefore contain of order a 
few hundred independent elements each. 



2.3 Particular issues in detecting non-Gaussianity 
in interferometer data 

There are a number of characteristics of interferometer ob- 
servations that require careful consideration when testing 
for non-Gaussianity. 

First, an interferometer does not measure the bright- 
ness distribution of the sky directly. Rather, it samples (re- 
gions of) the visibility plane, which is the Fourier transform 
of the sky brightness distribution convolved with the aper- 
ture illumination function (itself the Fourier transform of 
the primary beam). It is therefore most natural to test for 
non-Gaussianity in the visibility plane, where receiver noise 
is Gaussian and uncorrelated between samples, and the sig- 
nal is uncorrelated on scales larger than the aperture illu- 
mination function. The interferometer's intrinsic sensitivity 
to specific Fourier modes can also be exploited, for exam- 
ple, in searching for a non-Gaussian signature dominant only 
on certain angular scales. Targeting these scales essentially 
eliminates the contribution from non-overlapping Gaussian 
CMB fluctuations on other scales, which would otherwise 
mask the non-Gaussian signature. 
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Figure 1. Top left, top right and bottom left: The VSA1, VSA2 and VSA3 mosaics. The maps are produced 
using a MEM algorithm. The circles denote the FWHM of the primary beams of each of the individual pointings. 
Bottom right: a simulated Gaussian CMB realisation corresponding to the VSA1 mosaic. The box shows the central 
128x128 pixel region that is analysed in each case. The box is chosen to enclose the FWHMs but exclude the area 
of low sensitivity towards the edge of the whole map. In all cases, a central region 128x128 pixels, corresponding to 
7.36° x 7.36°. 



By contrast, images produced from interferometer data 
have a number of features which tend to complicate any 
statistical analysis performed on them. 



• The noise contributions between the individual pixels 
of an interferometer map are correlated over a wide range 
of angular scales. 

• Interferometers sample the Fourier plane incompletely. 
This leads to interferometer maps being convolved by a so- 
called 'dirty beam', with its associated sidelobes. This effect 
is not very significant for the VSA, as it has good sampling 
coverage of the visibility plane. It is also noted that the 
MEM algorithm we employ attempts to deconvolve the maps 
it produces. 

• The combination of data obtained using different pri- 
mary beams and array configurations (as in the case of the 
VSA) leads to significant variation in temperature sensitiv- 
ity and angular resolution across the map. This means that 



the effective probability density function (PDF) from which 
the map pixels are drawn will be non-stationary. 

Nevertheless, all of these complications can be resolved by 
the use of appropriate Monte Carlo simulations. 



3 CHOICE OF NON-GAUSSIANITY TEST 

There are an infinite number of ways for a PDF to be non- 
Gaussian but any given non-Gaussianity analysis is sensitive 
only to a subset of these. It is therefore prudent to apply a 
number of different analyses. In the following subsections, 
we describe the analyses employed in this paper. 

For the sake of clarity, in this paper we adopt the follow- 
ing terminology. We employ four different non-Gaussianity 
analyses. Each of these analyses consists of a number of indi- 
vidual statistics, for which we assess significance levels. For 
the map plane analyses, we then form a number of differ- 
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ent combined tests. Each test consists of a set of individual 
statistics. A test function is calculated from this set of statis- 
tics and used to assess a combined significance level. 

3.1 Analysis in the visibility plane 

We perform a Bayesian joint estimation of non-Gaussianity 
and the power spectrum of CMB anisotropics, before 
marginalising over the power spectrum to give an estimate 
of the non-Gaussianity of the data. We use the general form 
of the likelihood, derived from the eig enfunctions o f a lin - 
ear harmonic oscillator, as developed bv lRocha et aI.H200ll) . 
This distribution takes the form of a Gaussian multiplied 
by the square of a (finite) series of Hermite polynomials, 
where the coefficients a n are used as non-Gaussian estima- 
tors. These amplitudes a n can be written as series of cumu- 
lants iContaldi et al.lll999T) . In particular, they can be set 
to zero independently without mathematical inconsistency. 
Furthermore, perturbatively (that is when the cumulants are 
"small" in a suitable sense), the amplitudes a„ are propor- 
tional to the n th order cumulant. We modify the standard 
Gaussian maximum lik elihood method for a nalysing inter- 
ferometer observations (iHobson fc Maisingedl2002T) with our 
new distribution. Instead of assuming the simple Gaussian 
form for the p robability d istribution of each signal-to-noise 
eigenmode £i jBond et a.1.111998^) . we consider the more gen- 
eral situation in which all a n are set to zero, except for the 
real part of one of them. To illustrate, we consider the 0:3 
likelihood function 



P(x) = 



2ttoq 



. a 3 / x 
ao H — t==H 3 — = — 



(1) 



with ao = v/1 — a§ to ensure that P(x) is normalised to 
unity. 

We approximate the generalisation of this distribution 
to the multidimensional case, in the signal-to-noise basis, by 
simply taking the product of the individual one-dimensional 
distributions. Here the data are the signal-to-noise eigen- 
modes, £i, which are uncorrelated and have a covariance ma- 
trix given by (£<£j) = (l+a k Xi)Sij, where \i is the eigenvalue 
corresponding to the eigenmode £j and at = (^ 2 CV/(27r))fc, 
i.e. cifc is the average of value of 1 2 Ci/{2-k) in the fcth spectral 
bin. 

Thus, when considering the power spectrum in the fcth 
spectral bin, we adopt the likelihood function 



ao 



H, 



6 



, (2) 



where Cj = 1 + a^Xi. The a-j, could in principle depend on 
I, but for simplicity we have dropped this dependence. The 
cases of this test considered in this paper are for a single 
non-zero a n with n = 3 — 20. Some examples of such non- 
Gaussian PDFs are shown in Fig. [5] 

It is important to note that this method does not ensure 
that the higher order moments are uncorrelated and so this 
approximation will be invalid in the strong non-Gaussian 
limit. Such a case, however, would be straightforward to 
detect using other methods and this is thus not a significant 
problem. 

We note that the phase mapping technique of 



IChiang et al.l (|2002) is also well suited to analysing inter- 
ferometer data. For our data set, however, the individual 
visibility points have small signal-to-noise ratios. For exam- 
ple, each point on the VSA power spectrum (see Paper V) is 
the result of a likelihood analysis of several thousand binned 
visibility points. Therefore, such a phase mapping technique 
would tend to measure information about the VSA's noise 
properties rather than the CMB. 

3.2 Analysis in the map plane 

For analysis of possible non-Gaussianity in the map plane, 
we calculate three distinct sets of statistics. 

Map cumulants. The higher-order (K3 and above) cu- 
mulants of a Gaus sian distribution are all zero, (see e.g. 
iFerreira et alll997l) therefore, the cumulants of a set of data 
can conveniently be used to determine whether that data set 
is drawn from a Gaussian distribution or not. We estimate 
the higher-order (ks — Kg) cumulants of the pixels in our 
maps. 

Minkowski junctionals. For a Euclidean 2-D image such 
as our maps, the three Minkowski functionals are the sur- 
face area, peri meter and Euler char acteristic of an excursion 
region (see e.g. lHobson et alJll99Sh . This region is taken as 
the part of the map above a certain threshold temperature, 
T. The functionals are therefore functions of this variable. 
We calculate the 3 Minkowski functionals of our maps at 51 
different value of T between ±6 a (where a is the rms of 
the map pixels). The individual statistics are therefore the 
functional values at each of these 51 values of T. 

Wavelet cumulants. This analysis is not actually calcu- 
lated in the map plane, but the map provides the st art- 
ing point. Following the method of iHobson et alJ lll999l) we 
perform a wavelet transform on our map. We then esti- 
mate the skewness and kurtosis of subsets of the wavelet 
co-efficients using fc-statistics. There are a total of 21 dif- 
ferent subsets, corresponding to wavelets of given physical 
scales. This makes the wavelet test ideal for detecting a given 
non-Gaussian feature repeated many times in a map. The 
individual statistics are therefore the 21 separate skewness 
and 21 separate kurtosis values. These are repeated for each 
of the nine wavelet bases considered by Hobson et al, giving 
9 x 42 wavelet statistics in total. 



3.3 Evaluation of individual confidence levels 

The joint estimation analysis uses a Bayesian framework and 
so calculates a full marginalised posterior probability distri- 
bution of the non-Gaussian parameter for each statistic. For 
the map plane statistics, however, we adopt a frequentist ap- 
proach and so must define the Gaussian confidence regions 
on our statistics in terms of appropriately simulated obser- 
vations of Gaussian sky realisations. To do this, we simulate 
a large number (1000 for the results given in this paper) of 
equivalent Gaussian realisations (EGR), calculate the sta- 
tistical values for each one, and use these values to define 
the Gaussian confidence regions on each individual statis- 
tic. This also allows us to account for the complications of 
analysing interferometer maps. 

For the purposes of this paper, we produce our EGR as 
follows: 
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Figure 2. Some fairly extreme examples of the non-Gaussian PDFs used in the joint estimation test. Shown are 
PDFs with non-Gaussian parameter values of 03 = 0.5 (top left), 04 = 0.3 (top right), aio = —0.4 (bottom left) 
and 017 = 0.25 (bottom right). 



• The real and imaginary parts of each Fourier mode a; m 
are drawn independently from a Gaussian random distribu- 
tion. The variance of this PDF is determined by an underly- 
ing power spectrum, given by a CDM model using the VSA 
best-fit parameters for this data set (see Paper VI). 

• The modes are inverse Fourier transformed to produce 
a Gaussian CMB realisation. 

• For each field in the simulated mosaic, the realisation is 
multiplied by the appropriate primary beam for that field, 
and Fourier transformed to produce the visibility planes for 
each field. 

• The visibility planes are sampled using a real VSA ob- 
servation as a template. Gaussian noise is then added to 
each sampled visibility, using the same rms as given in the 
VSA template file. 

• A map is then made from this simulated mosaiced ob- 
servation, using the same software pipeline that is used to 
produce the real maps. 

The EGR are therefore simulated observations of a 
Gaussian sky with a realistic power spectrum, sampling the 
visibility plane correctly for a given observation and with 
correct noise contributions. 



3.4 Evaluation of combined confidence levels 

We have detailed methods of obtaining significance levels for 
all of the individual statistics we are considering. However, 
we can place stronger limits on non-Gaussianity by combin- 
ing these statistics in varying ways. We note that this will 
not be attempted for the Bayesian joint estimation analy- 
sis, which assumes a different PDF model in each case, and 
also that the CMB power spectrum is divided into different 
band powers, making such a combination unsuitable. We 
therefore only combine the map plane statistics. The com- 
binations used are listed in the first column of Tabled 

To produce an overall significance level for a given com- 
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Figure 3. Example of a non-normally distributed histogram of 
statistical values. Shown is the histogram for the kurtosis wavelet 
test applied to 1000 EGR, using the Haar wavelet basis. 

bination, we need to calculate the value of some test func- 
tion of the individual statistics included. An obvious choice 
would be a chi-squared test. However, many of the statis- 
tics used in this paper are drawn from non-normal distri- 
butions (see e.g. Fig. making a standard chi-squared 
test unsuitable. A non-normal version of the chi-squared 
has been derived jFerre ira et all Il998t) but as noted by 
iBarreiro fc Hobso nl feOOll) for the wavelet statistics in par- 
ticular, it is very hard to evaluate . We therefore follow the 
method of iMukheriee et alJ i2000T) and use a test function 
that is simply the number of individual statistics in the com- 
bination that show a 95% significant deviation from Gaus- 
sianity. The combined significance level is therefore the pro- 
portion of EGRs with test function values smaller than that 
obtained for the real data. It is noted that this method 
will not identify a single highly non-Gaussian statistic value. 
However, any such case will be self evident, and so this does 
not present a problem. 
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Figure 4. Three of the simulated-skies used in the simulations. Shown are the Gaussian CMB realisation (left), the 
pure cosmic strings map (middle) and the CMB/point sources (subtracted to 5m Jy) (right). 



4 ANALYSIS OF SIMULATED 
OBSERVATIONS 

In order to characterise the effectiveness of our chosen non- 
Gaussianity analyses in detecting various types of non- 
Gaussianity, we applied them to simulated VSA observa- 
tions of a number of underlying simulated-skies. The sim- 
ulated observations are produced in the same way as the 
EGR (see above), except that the underlying Fourier modes 
are provided by the Fourier transform of the simulated-sky. 
The simulated observations have the same relative pointing 
centres, visibility-plane sampling and rms noise levels as the 
six fields (three compact array and three extended array) 
comprising the VSA1 mosaic. Therefore, the simulations re- 
alistically sample the visibility plane and include realistic 
levels of Gaussian noise. 

We considered the following simulated-skies (see Fig. 

Gaussian CMB realisation. A Gaussian realisation of 
the CMB with a power spectrum given by the VSA best-fit 
cosmological parameters for the CDM cosmological model 
(see Paper VI). This is included principally to ensure that 
our non-Gaussianity tests do not produce spurious detec- 
tions. 

Pure cosmic strings. A map containing cosmic strings 
networks feouchet et aljll98ct) . The map is scaled to have a 
pixel rms equal to that of the Gaussian CMB map. It is noted 
that we only use a single realisation here. While it is clearly 
preferable to run any analysis on many different realisations, 
the lack of a large set of strings simulations prevents this. 
The analyses of this map do not therefore contain the effect 
of sample variance. 

Composite Gaussian/strings. The sum of the above 
Gaussian CMB and strings maps, rescaled to have the same 
rms as either individual map. 

Bright point sources/CMB. A power law distribution 
of point sources defined by the extrapolation to 34 GHz of 
15 GHz source counts made by the Ryle telescope (see Paper 
II for details). 

Source subtracted. The same point source distribution 
plus CMB as above, but with all sources above a given 
flux density removed (to mimic the VSA source subtraction 
strategy). Subtraction levels of 20mJy, 10m Jy and 5m Jy are 
considered. The VSA data analysed in this paper has a nom- 
inal source subtraction level of 20mJy. See Paper II for a 
more detailed discussion. 
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Figure 5. Example detection of the presence of bright points 
sources in the data using the joint estimation analysis. Shown 
is the marginalised posterior probability distribution calculated 
from simulated extended array data in an annular visibility plane 
band centred about I = 1085. The solid and dashed lines show 
the results with and without the bright point sources. 



4.1 Visibility plane simulations 

We applied the Bayesian joint estimation analysis to each of 
the 6 field centres individually. Computing constraints mean 
we are unable to use Monte Carlo techniques to characterise 
the test fully (each analysis takes several hours). We there- 
fore focus on unambiguous detections. 

Simulated observations of the Gaussian CMB field pro- 
duced no statistically significant detections. Moreover, this 
approach was found to be insensitive to the presence of cos- 
mic string networks in both the composite and pure strings 
simulations. However, clear detections were obtained in the 
presence of bright point sources. The most significant detec- 
tion occurred in one extended array field where three sources 
were present, two with fluxes of order 200 mjy, and one of 
order 100 mjy (see Fig. |S] solid line) . 

Analysis of the same simulated data where all point 
sources above a flux of 20 mjy (the level of subtraction 
applied to the real extended array data) were removed pro- 
duced a marginal detection in the same extended array field. 

It is noted that in the visibility plane, information 
about the sky (in the form of Fourier modes) has been 
convolved by the aperture illumination function. Any vis- 
ibility is therefore a linear combination of several (possibly 
non-Gaussian) Fourier modes. This combination tends, be- 
cause of the central limit theorem, to lessen the amount of 
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Figure 6. Example test functions for simulations of various simulated-skies. The stars show the test function values for the EGR, and 
the vertical bars show the test function values for the simulations. Both are calculated from 1000 realisations. The rows of images show 
the results for pure strings (top row), composite strings (second row), 20m Jy (third row), 10m Jy (fourth row) and 5mjy (5th row) source 
subtraction levels. 



non-Gaussianity present. We expect this effect to impair the 
ability of the joint estimation analysis to detect cosmologi- 
cal non-Gaussian structure. It is, however, ideally suited to 
detect non-Gaussianity in the visibility plane which, by a 
similar argument, would be poorly detected by map plane 
statistics. It is therefore important to analyse both planes. 



4.2 Map plane simulations 

We made 1000 simulated observations of each sky, each with 
a different noise realisation. By calculating individual statis- 
tics and combining them into test functions for all 1000 sim- 
ulated observations, as well as for 1000 EGR, we produce 
distributions that show the power of any given statistic or 
test function to detect the presence of the simulated sky. 

We applied the map plane analyses to mosaic maps of 
the simulated data. See Fig. |S| for examples of results of 
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three of the tests, namely map cumulants, Minkowski func- 
tional: perimeter and the Daubechies-4 kurtosis wavelet test. 
We found that simulated observations of the Gaussian CMB 
field produced no statistically significant detections in any 
of our tests. 

The pure cosmic strings were detected by all three tests. 
In particular, the Minkowski functionals and wavelet test 
are clearly highly sensitive to the presence of pure cosmic 
strings. In contrast, none of the tests were sensitive to the 
presence of the composite strings/Gaussian CMB simulated- 
sky. 

All of the tests were highly sensitive to the presence 
of bright point sources, and were able to detect the pres- 
ence of a source distribution up to 20 mjy. At 10 mjy, the 
wavelet test is still marginally sensitive to their presence, 
even though the other two tests are no longer sufficiently 
sensitive. At 5 mjy, all the tests are insensitive to the pres- 
ence of sources. 



5 ANALYSIS OF VSA OBSERVATIONS 

We applied all of our chosen non-Gaussianity tests to the 
VSA observations. The joint estimation test is applied sep- 
arately to the data for each of the 17 individual fields. The 
map plane tests were applied separately to each of the 3 
mosaiced fields. 



5.1 Visibility plane results 

For all 17 individual pointing centres, no statistically signif- 
icant deviations from Gaussianity were detected. 

During the early stages of the reduction of the com- 
pact array fields, this technique detected significant non- 
Gaussianity in the data from 3 fields. Further investigation 
showed that these detections were caused by a small num- 
ber of high amplitude visibilities in the data (no more than 
5 in each case, and as few as a single point). In all cases, 
they were unambiguously traced to residual spurious signal 
(see Paper I) and removed. Fig. |7| shows an example of the 
likelihood functions obtained before and after the removal 
of a single contaminated point. 

While the basic VSA data reduction procedure has been 
demonstrated to deal more than adequately with the spu- 
rious signal, this technique has served as an excellent addi- 
tional diagnostic of the VSA data, saving considerable time 
and effort during data reduction. It is now used as a stan- 
dard part of the VSA data reduction process. 

5.2 Map plane results 

The combined test function values and significance levels 
are given in Table Q The highest significance obtained was 
96.7% , in the VSA1 mosaic, from the combined wavelet 
kurtosis statistics, using the Daubechies-4 basis. The highest 
significances in the VSA2 and VSA3 mosaics were 87.3% 
and 74.8%, using the area Minkowski functional and the 
Daubechies-4 kurtosis wavelet test, respectively. Figs. |H| [5] 
and 1101 show example EGR test functions for the VSA1, 
VSA2 and VSA3 mosaics. In each case, these functions are 
determined from the 1000 EGR calculated for each data set. 



Marginalised likelihood function 




-0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 

nor-Goussion parameter, a 

Figure 7. Example marginalised posterior probability distribu- 
tion obtained from analysing one of the compact array fields dur- 
ing data reduction using the joint estimation analysis. The data 
were subsequently discovered to be contaminated by residual spu- 
rious signal in a single binned visibility. The annular bin analysed 
here contained 47 binned visibilities. The solid and dashed curves 
show the data with and without the contamination. 

The value of the test function obtained for the real data is 
shown as the solid vertical line. 

While the 96.7% detection in the VSA1 mosaic hints at 
the presence of a source of non-Gaussianity, the significance 
level is not high enough to rule out Gaussianity. Moreover, 
it is noteworthy that simulations showed the Daubechies-4 
kurtosis wavelet test in particular to be sensitive to realistic 
point source distributions below the VSA's source subtrac- 
tion level. Therefore, this detection may be the result of 
slight point source contamination. Another point of note is 
that the VSA1 mosaic is predicted to have almost double 
the Galactic foreground contamination of the other two mo- 
saics (5.5 [iK, as opposed to 2.7 fiK in the VSA3 mosaic; 
see Paper II for details). Whilst this is not a major con- 
taminant, the Daubechies-4 kurtosis wavelet test may be 
sensitive enough to detect it. The addition of further ex- 
tended array data to this data set will significantly improve 
our sensitivity to non-Gaussianity and may enable us to ei- 
ther identify the source of the non-Gaussianity in the VSA1 
mosaic, or rule it out as a chance statistical fluctuation. 

That significant non-Gaussianity detections are not 
found in all three mosaics confirms the success of our source 
surveying and subtraction strategy. Simulations show that 
if the power law distribution of radio sources predicted from 
our source surveying observations continues below 20 mjy, 
we would expect a detection in approximately one field in 
three using the Daubechies-4 kurtosis wavelet test - as in- 
deed we find. Note that the VSA power spectrum (see Paper 
V) includes a statistical correction for this predicted residual 
point source contamination. 



6 CONCLUSIONS 

We have analysed the VSA data sets presented in Papers V 
and VI for the presence of non-Gaussianity. We have found 
the following: 

• The VSA2 and VSA3 mosaics are consistent with Gaus- 
sianity. 
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Figure 8. VSA1 mosaic example results and Gaussian test functions. The Gaussian test functions are denoted by 
stars, and the value calculated from the actual data is shown by a vertical bar. 



Minkowski functional: perimeter 



Wavelet kurtosis (Daubechies-4 basis) 



Number of 95% detectio 



■ - 

10 20 00 40 50 60 

Nurrber of 95% detections 



Number of 95% detectio 



Figure 9. VSA2 mosaic example results and Gaussian test functions 
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Figure 10. VSA3 mosaic example results and Gaussian test functions 



• In the VSA1 mosaic, a 96.7% detection of non- 
Gaussianity was found with the kurtosis wavelet test (using 
the Daubechies-4 wavelet basis). We conclude that while this 
may hint at a source of non-Gaussianity, the level is consis- 
tent with that expected for the known residual point source 
contamination in the VSA data. It is therefore unlikely to 
be cosmological in origin. 

• Non-Gaussianity testing in the visibility plane has 
proven an invaluable tool for locating systematic effects in 
data. It is therefore vitally important to test interferometric 
observations of the CMB for non-Gaussianity in both the 
visibility and map planes. 
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Table 1. Test function values and combined significance levels for the different cases of each map plane non- 
Gaussianity test. The results for each of the three VSA mosaics are given. 
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